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( Abstract: Simultaneous Localization and Mapping (SLAM) is a pivotal technology in computer vision and robotics, enabling autonomous machines to navigate and incrementally map unknown 
environments. This technology simultaneously tracks the machine's location and orientation while mapping the surroundings, a dual challenge that has seen considerable evolution over the past two | 
decades. This project employs a 3D Gaussian Splatting (3DGS)-based SLAM algorithm to achieve dense mapping of environments. In the future, we plan to utilize handheld devices | 
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| equipped with multiple sensors to perform multi-sensor fusion for data collection and scene reconstruction in indoor environments. 
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Background 
Based on the initial 3D Gaussian 
Splatting, SplaTAM[1] Gaussian 
Splatting SLAM/ MonoGS[2], etc. are 
among the latest 3D GS based SLAM 
algorithms. These methods usually 
involve gradient descent, 3D Gaussians 
set, and dynamic addition and deletion 
of Gaussians strategy, Almost all 
strategies use the concept of key frames 
and introduce color error and depth 
error in loss. However, the selection 


3D Gaussian splatting SLAM 
Firstly, the input sparse point cloud is 
initialized to create a 3D Gaussian. Next, it 
works together with Sfm calibrated cameras to 
project 3D objects onto a 2D plane, allowing 
for fast a-blending for rendering. The core of 
the flow path is optimization. It adaptively 
creates a dense set of 3D Gaussians to 
optimize position, opacity, color and 
covariance. These optimized parameters are 
reprojected into the 2D plane, ultimately 


MonoGS 


Firstly, tracking establishes and optimizes the 
camera pose through the images and formulas 
observed by the camera. Next, repeatedly 
select and filter keyframes in 3D 
reconstruction. Meanwhile, in each keyframe, 
a new Gaussian is inserted into the scene. 
Finally, mapping maintains the coherence of 
the 3D scene and optimizes the new Gaussian. 
Meanwhile, two old keyframes are randomly 
selected for each iteration to ensure high 
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Code Reproduction aad \ 
| leading to generated simulation results: 


achieving high fidelity reproduction of the fidelity of the map. 
image. e In Tracking: 
e Definition of 3D Gauss: The photometric residual 1s: 
G(x) =e 207Z o = IMG, Tew) - Illa 
a geometric residual is: 
Egeo = ||D(G, Tew) — Dili 


strategy of key frames is slightly 
different. In optimizing Gaussians, a 
small number of key frames are 
selected to participate in the calculation 
of color error. In Gaussian Splatting 
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SLAM, the current keyframe window : e The covariance of the projection in 2D 
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l . Th | depth i 
and the randomly selected previous two Spare l e pixel depth is: 
frames are used. In SplaTAM, current » =w% wiyt Dp =) Pelt [Je Ta) 
frame, previous fame and some L a J 


previous key frames that have high 
overlap with the current frame are 
selected. 


e In mapping: 
The isotropic regularization 1s: 
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e Covariance of optimized 3D Gaussian: 


Aim Y input Color/Depth 
e Distinguish the differences between 
different Gaussian splating methods. 
e Simulate 3D Gaussian and MonoGS. 
e Use handheld devices equipped with 
multiple sensors for multi-sensor 
fusion in indoor environments for 


Figure.5. Keyframing of MonoGS[3]. 
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Mapping ATE RMSE: 0.013208777237770622 


Window Optimisation 
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Tile Rasterizer 
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data collection and scene ee 
- SfM Points 3D Gaussians : -1 
reconstruction. Density Control —> Operation Flow Gradient Flow 
. . . 2. SLAM System Overview: Our SL system uses 3 ssians as 
e Algorithm implementation and ie a g tracking, mapping, ke ip nanagement, and novel view synthesis. =2 Boo 
optimization. Figure.1. Optimization starts with the sparse SfM point cloud and ca? * as 
i Figure.2. SLAM System Overview[2 
\ , creates a set of 3D Gaussians[3]. £ [2] / x Figure.6. Simulation result of MonoGS[3]. 7 
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Tei we have completed the software installation for Ubuntu and l 
configured the ROS, Cuda, conda, and other environments, as well as į 
deeply learned the SLAM algorithm. I 
In the simulation phase, SLAM greatly improves the fidelity and I 
diversity of captured object materials through efficient texture! 
mapping. However, Gaussian functions cannot explicitly represent ! 
surfaces, so the closure of loops and extraction of geometric shapes in 
large-scale scenes will be worth future research. Gaussian splatter | 
technology has shown outstanding performance in real-time rendering į 
and dynamic scene representation, providing effective solutions for | 
computer-generated images, virtual/augmented reality, robotics, film | 
animation, retail, environmental research, and aerospace. The! 
technology still faces some challenges, such as computational ! 
complexity, memory usage, edge artifacts, and the trade-off between 
real-time performance and accuracy. i 
Finally, our handheld device has been successfully built and tested. Its į 
edge computing function can facilitate more convenient data | 
collection and provide real-time output of current environment I 
reconstruction. The integration of multiple sensors can further ! 
enhance the robustness of the 3DGS algorithm and improve the ! 
visualization of data from other sources, which is worthy of further 
Jnvestigation. I 
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